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□  □ 


Abstract 


Necessary  and  sufficient  conditions  for  identifiability  of  the 
diffusion  coefficient  in  Galerkin  approximations  to  a  two  point 
boundary  value  problem  are  derived  for  various  choices  of  Galerkin  sub¬ 


spaces.  The  results  are  further  used  to  investigate  output  least  squares 
identifiability  and  output  least  squares  stability  of  the  diffusion  co¬ 


efficient. 


Key  Words:  Parameter  estimation stability  of  output  least  squares 


problem,  boundaryvahie  problem. 


Identifiability  Under  Approximation  for  an 
Elliptic  Boundary  Value  Problem 


1.  Introduction 

In  this  note  we  study  the  following  boundary  value  problems: 

-(aux)x  ♦  cu  =  f  in  (0,1) 

(1.1) 

V°)  ‘  «X(U  s  0. 

Let  I  =  (0,1)  and  f  £  L2(I).  Recall  that  if  a  €  H^I)  with  a(x)  > 

2 

a  >  0  and  c  €  L  (I)  with  c(x)  :>  £  >  0  a.e.,  then  there  exists  a 

2 

unique  solution  u  =  u(a)  of  (1.1)  in  H  (I).  We  are  concerned  with 
the  identification  of  the  coefficient  a,  given  information  of  the  solu¬ 
tion  u(a)  and  in  particular  we  will  study  the  injectivity  of  the  mapping 
aM  -*•  uN(aM)  where  aM  is  some  approximation  to  a  and  uN  an  approxi¬ 
mation  to  u.  At  first  we  describe  the  problem  in  a  more  general  context. 

Let  H  (I)  Z  be  a  continuous  linear  operator  from  the  solution 

space  to  the  observation  space  Z  describing  the  type  of  available 
information  of  the  state  u.  To  determine  the  coefficient  corresponding 
to  an  observation  z  €  Z  of  the  system  that  is  modelled  by  (1.1)  the 
following  output  least  squares  formulation  is  used  frequently: 

2 

(1.2)  minimize  [^(a)  -  z\t. 

Q.d 

Here  the  set  Qa^  of  admissible  parameters  is  chosen  such  that  the 
existence  of  a  solution  of  (1.2)  is  guaranteed. 


For  example,  if  n  observations  taken  at  the  points 

are  available,  we  may  take  Z  *  Rn  with  if:  H2(I)  -*-Fn  de¬ 
fined  by  Sfu  ■  ^u(xk^k*l‘  In  t*'*s  case  (1<2)  becomes 

n  2 

(1.3)  minimize  J  |u(xw;a)  -  zj  . 

<5.d  k*> 

2 

Alternatively  one  might  have  distributed  observations  z  E  L  (I),  or 

using  the  data  {z^jJ!  at  {x^jj^j  one  might  want  to  obtain  a  function 
2 

z  €  L  (I)  either  by  interpolation  or  least  squares  regression;  for  % 
we  would  then  take  i£u  =  u  and  the  optimization  problem  becomes 

(1.4)  minimize  |u(a)  -  z| 

Qad  L 

In  either  case  an  appropriate  choice  for  Qad  would  be 

Qad  *  (a  €  H1:  a(x)  >  a  >  0,  |a|  j  <  «), 

H 

with  y  >  a,  [6] . 

Defining  the  attainable  set  {5&( a):  a  €  Qad>  one  may  view 

the  optimization  problem  (1.2)  as  having  two  parts: 

(i)  given  z  €  Z  find  z  .,  the  projection  of  z  on  <2', 

proj 

(ii)  given  z  ...  find  a  £  Q  .  such  that  5fu(a)  =  z 

*  proj  xad  v  '  proj 

Assuming  the  existence  of  zproj  *  t*le  uniqueness  of  Zproj  depends 

on  the  geometry  of  cy.  In  (ii)  there  exists  an  a  such  that  S£u(a)  * 
zproj  definition  of  Qad>  The  question  of  uniqueness  of  such  an  a- 
arises  and  it  is  guaranteed  if  $:  a  -*$£u(a)  is  injective  at  T. 


Injectivity  of  ♦  at  a  is  called  identifiability  of  a  at  a.  The 
above  mentioned  uniqueness  problems  are  rather  involved  in  general,  see, 
e.g.,  [2;  7,  Appendix],  and  [11]  for  a  hyperbolic  equation. 

When  solving  (1.2)  on  a  computer  it  is  necessary  to  replace  (1.1)- 
(1.2)  by  a  finite  dimensional  problem.  This  is  done  by  approximating  both 
the  solutions  of  (1.1)  and  the  set  by  functions  from  finite  dimen¬ 

sional  function  spaces.  A  finite  dimensional  version  of  the  minimiza¬ 
tion  problem  (1.2)  is  then  solved  to  obtain  an  estimate  for  the  unknown 
coefficient  a  (compare  e.g.,  [1],  [5]).  Again  the  existence  and  unique¬ 
ness  questions  analogous  to  the  two  steps  (i)  and  (ii)  above  can  be  con¬ 
sidered. 

The  main  purpose  of  this  investigation  is  the  study  of  the  uniqueness 

for  the  finite  dimensional  analog  of  (ii) .  If  for  a  chosen  approximation 

M  i/  KM  —W 

of  a  by  a  the  mappings  a  -*•  u  (a  )  is  injective  at  a^,  then  a  is 

—  M 

called  identifiable  under  approximation  at  a  .  The  related  question 

for  parabolic  equations  in  dimension  one  has  been  treated  in  [4] . 

M 

Our  results  below  indicate  that  the  injectivity  of  a  depends 
upon  certain  rank  conditions  that  imply  compatibility  conditions  upon 
the  spaces  used  to  approximate  the  coefficient  a  and  the  solution  u(a) . 

It  will  be  seen  that  a  may  be  identifiable  under  approximation  without 
the  known  sufficient  conditions  for  identifiability  of  a  in  (1.1)  being 
satisfied  (9).  The  results  here,  although  depending  on  the  choice  of  Neumann 
boundary  conditions  can  easily  be  adapted  to  different  boundary  conditions. 

In  section  2  we  formulate  the  discrete  problems  and  give  general  con¬ 
ditions  for  identifiability  under  approximation.  In  section  3  we  examine 
several  concrete  examples  and  obtain  necessary  and  sufficient  conditions 


for  identifiability  under  approximation  for  these  cases.  Identifiability 
will  be  guaranteed  if  there  is  a  sufficient  amount  of  movement  in  the 
observations.  On  the  other  hand,  if  the  coefficient  is  assumed  to  be 
known  at  points  where  the  observations  are  stationary,  then  it  can  still 
be  identifiable  at  the  remaining  parts  of  the  domain  (0,1).  Section  4  is 
devoted  to  the  problem  of  continuous  dependence  of  the  solution  of  the  des- 
cretized  version  of  (1.3)  or  (1.4)  on  the  observation  z  and  Qa(j*  Sufficient 
conditions  for  output  least  squares  identifiability  (OLSI)  [2]  and  output 
least  squares  stability  (OLSS)  [3]  are  given.  Finally,  in  section  5  we 
report  the  findings  of  a  numerical  experiment  that  supports  the  practical 
relevance  of  our  results. 


2.  Basic  Results 


To  approximate  (1.2)  by  the  standard  finite  element  method  [10] 

N  M 

let  an<*  {$j)j_i  be  sets  of  linearly  independent  functions 

defined  on  I  with  6  H^I)  and  <J> ^  piecewise  continuous.  Let 
M  N 

A  =  span(4»j :  j  *  1,...,M)  and  H  =  span{B^:  i  =  0,...,N}.  Setting 

N  u  l?  M 

=  £  V-B.  and  a  =  £  a.4». 

i=0  11  j=l  3  3 

N 

we  have  upon  integration  by  parts  of  (1.1)  with  u  replaced  by  u  : 


N  N 

l  y.<aB.  ,B  >  *  l  p.<cB  ,B  >  =  <f,B.>  for  k  =  0.....N, 

^=0  1  iv»a  i=0  1  A  *  * 


2  M 

where  <■,•>  denotes  the  inner  product  in  L  .  Replacing  a  by  a' 

follows  that 


N  M  M  N 

l  y.  I  a  <4.  B  B  >  ♦  l  y  <cB  ,B  >  =  <f,B  > 
i=0  1  j=l  3  3  1,x  i=0 


for  k  =  0, . . . ,N. 


Rearranging  the  summations  in  this  last  expression  we  arrive  at 


(2.1)  l  a  j  <*  B  B  >U  .  j  ‘cBj.B^Vj  -  <f.B  > 

J  =  1  J  1=0  J  1=0 


for  k  =  0, . . . ,N. 


We  now  make  the  following  definitions:  and  K  are  (N+l)  *  (N+l) 

matrices  with  the  (i,k)-th  elements  given  by 


<Vi,k  -  "Vi.x’Yx"  and  Wi.k =  <cBi'V* 

~  M+1  U+l 

for  i,k  =  0,...,N;  j  =  Similarly  f  C IR  ,  y  €  IR  and 

~M  e  _.M  , 

a  £1R  are  given  by 

(f)k  =  <f ,B^>,  (vi).  =  vi  and  (aM)..  =  a*?. 

With  this  notation  (2.1)  becomes 


(2.2) 


T  a^H .y  +  Ky  =  f , 
jii 


where  we  used  the  symmetry  of  and  K.  Thus  we  obtain  a  mapping 

~M  M  .  _N+1  .  ,  _  M  N,  M. 

a  -*■  y(a  j  from  HI  into  K  or,  equivalently,  a  -*•  u  (a  ), 

M  r  M  M  N 

a  *  i  a.<p.,  from  A  to  H  ,  that  is  well  defined  as  long  as 
M  J=1  3  3 

l  a^nT  ♦  K  is  invertible.  For  example,  if  c  >_  0  and  (B.  }.  n 

.  ,  j  j  i,x  i-u 

J-1  M 


linearly  independent,  then  £  a^H.  ♦  R  is 

j  =  l  3  3 


invertible  for  all 


-M  —  i  r  o-M  _.M  r  .  A  T  -i 

a  €  A  *  {a  €  IR  :  \  (a  ).$.  >  0  on  I}. 

j*i  3  3 


>  *  •  ^  M  »■  *  L  -  -  -*»  -  •  Ij.  -  L  >  AA  .V  •»  k  lV  C*.  l 


Similarly,  if  c  >_y  >  0  as  assumed  throughout,  then  again  £a.H.  +  K 

M  ^ 

is  invertible  for  all  a  €  A. 

We  now  define  identifiability  of  aM  =  col(a^, . . . ,a^)  in  (2.2). 

Definition  2.1.  The  parameter  a  6  A  in  (2.2)  is  called  identifiable 
if  y(a  )  =  y(b  )  implies  a  =  b  for  all  b  €  A. 

M 

For  a  specific  choice  of  approximation  of  a  in  (1.1)  by  a  ,  we 

M  M 

say  that  a  is  identifiable  under  approximation  at  a  if  a  is 
identifiable. 

Theorem  2.1.  Let  and  b®  linearly  independent, 

«vM  -m 

c(x)  ^  £  >  0  and  a  6  A.  Then  a  £  A  is  identifiable  if  and  only  if 
the  vectors  {H^y(a^)}^_^  are  linearly  independent. 

Proof.  Using  (2.2),  linear  independence  of  H^y(a  )  clearly  implies 

identifiability  of  a^.  Conversely  assume  that  there  exists  a  nontrivial 

M 

vector  (a1,...,aM)  €  IR  with 


(2.3) 


M  w 

l  a.H  p(a  )  =  0. 
j=l  ^  1 


M 


Then  £  (a^  -  ecx.)4>.  >  0  for  some  sufficiently  small  e  >  0  and  a^ 

1=1  J  J  J  1 

^  A  -vM  M  M 

given  by  (Sp^  =  a.  -  eot..  satisfies  a1  €  A.  Multiplying  (2.3)  by  e 
and  subtracting  it  from  (2.2)  we  find  that  y(aM)  =  y(a^) .  This  ends 
the  proof. 

N+l  N+l 

Since  Hp  3R  -*■  IR  ,  for  j  =  1,...,M,  we  have  the  following: 


Corollary  2.1.  If  M  >  N+l ,  then  a 


in  (2.2)  is  not  identifiable. 


Corollary  2.2.  If  y(i]  €  Ker(KL)  for  some  j  *  1,...,M,  then  a*1 
is  not  identifiable. 

Proof.  If  C(a^)  €  Ker(Hj)  then  the  set  {HjO(a^)}^Bj  is  linearly 
dependent  and  the  result  follows  from  Theorem  2.1. 

Remark  2.1.  Corollary  2.2  should  be  compared  with  the  condition 

|u  |  ^  hj  >0  which  is  known  to  be  a  sufficient  condition  for  identifi- 

ability  of  a  in  the  infinite  dimensional  problem  (1.1)  [9], 

A 

«%M  M  a 

Definition  2.2.  The  coordinates  (a.  M  <_  M,  of  the  parameter  vector 

Jr  K=1 

a^  €  A  are  called  identifiable  if  y(a^)  =  y(b^)  and  =  b^  for 
all  j  t  jk>  k  =  1,...,M  imply  aM  a  bM,  for  all  bM  €  A. 

W  0  |yj 

Proposition  2.1.  The  coordinates  {§.  }  of  a  £  A  are  identifiable 

k=1 

if  and  only  if  the  vectors  {H.  0(2**)}?*  are  linearly  independent. 

lk  k=1 

The  proof  is  obvious  from  that  of  Theorem  2.1. 


3.  Several  Examples 


In  this  section  we  consider  several  concrete  examples  and  determine 
their  identifiability  properties.  We  point  out  that  here  we  use  N  to 
denote  the  number  of  subintervals  of  I  and  N  and  M  of  the  previous 
section  are  a  function  of  this  N. 

Case  1.  Let  I  be  partitioned  into  N  subintervals  of  length  1/N. 

For  i  =  0,...,N  define  the  linear  spline  basis  functions 

i  1  i 

Nx-i+l,  -jp  <  x  , 

(3.1)  B.(x)  =<  -Nx*i*l, 

0  otherwise , 

and  for  j  =  1,...,N  the  0-th  order  splines 


(3.2) 


^(x) 


j-1  <  *  <  ■i. 

N  —  *  —  N’ 
otherwise. 


Thus  N  and  M  of  the  previous  section  are  both  N  here. 

We  approximate  the  solution  u  of  (1.1)  by  linear  splines  and 
the  coefficient  a  by  constant  splines.  It  is  straightforward  to  com¬ 
pute  the  (N+l)  x  (N+l)  matrices  H.,  j  =  1,...,N: 


10 


Let  y  »  y(aM)  =  col(yQ, . . . 


Now  set  8.  =  y.  -  y.  ,  for  i  « 
3  3  3-1 

independence  of  the  vectors  {H^y}^^, 


To  study  the  linear 

note  that 


(H1y,...,HNy)  =  NB 


where  B  is  the  (N+l)  x  N  matrix 


11 


Lemma  3.1.  The  vectors  are  linearly  independent  if  and 

only  if  6^0  for  all  i  =  1,...,N. 

Proof.  It  is  easily  shown  that  B  is  row  equivalent  [8]  to 


provided  ?  0  for  all  i  and  {lLy}j  =  1  are  linearly  independent 
in  this  case.  Conversely,  if  $.  *  0  for  some  i,  then  the  column  rank 
of  B  is  less  than  N  and  linear  dependence  of  {H^y}^^  follows. 

Theorem  3.1.  In  Case  1,5^  is  identifiable,  if  and  only  if  y^(a^)  f 
Vi  for  all  i  =  1,...,N. 


Proposition  3.1.  The  coordinates  {a^  .  of  a^  are  identifiable  if 

and  only  if  y.  (aM)  4  y.  ,(aM)  for  all  k  *  1,..,,M. 

^k  ■'k 

The  interoretatinn  of  thi«  result  is  that  the  riarameter  a*1  can  onlv  be 


identified  at  coordinates  where  the  corresponding  observations  are  non¬ 
stationary  . 


12 


Case  2.  Let  N  be  even  and  let  I  be  partitioned  into  subintervals 
of  length  The  functions  IL,  i  *  0,...,N,  are  taken  as  in  (3.1). 
Here,  however,  we  define  the  functions  ^  for  j  *  1,...,  j  by 


<Dj(x) 


1,  Hiill  <  x  < 

*  N  —  x  —  N 


0  otherwise. 


N 


Thus  M  of  section  2  is  j  now.  We  find  in  this  case  that  the  (N+l) 
(N+l)  matrices  ,  for  j  *  1,...,  j  are  given  by 


H. 

J 


where  the  first  entry  of  the  nontrivial  submatrix  is  in  the  2j-2,  2j-2 
position  of  H..  With  p  «  y(S  )  *  colCp^, . . .  ,yN)  as  before  we  have 


V 


=  N 


V2j-2  ‘  y2j-l 
■y2j-2  +  2y2j  -1 
-y2M  +  y2j 


-  V 


2j 


0 


13 


where  the  first  nonzero  entry  occurs  in  the  2j-2  coordinate.  Setting 
®i  *  ^i’^i-l  ^or  *  *  1..-..N  we  thus  have 


for  j  «  1, 
note  that 


To  investigate  the  linear  independence  of 


{H.y> 


N/2 

j»l 


$ 


(H15,...,HN/2ii)  *  NB 
N 

where  the  (N+l)  x  —  -matrix  B  is  given  by 


and  the  collection  of  vectors  {H^y},.^  is  linearly  independent  if  and 
only  if  rank(B)  *  y. 

Lemma  3.2.  In  the  Case  2  the  vectors  {H^y}1?^  are  linearly  independent 

JJ 

if  and  only  if  t  0  or  $2^  t  0  for  *  *  y- 

Proof.  The  matrix  B  is  row  equivalent  to 


From  this  and  the  fact  that  the  dimension  of  the  column  space  of  a  matrix 
is  equal  to  the  rank  of  that  matrix,  the  result  follows. 

M 

Theorem  3.2.  In  Case  2,  £  is  identifiable  if  and  only  if  y2il  t 

,  *  N 

y2i-2  0r  y2i  **  y2i-l  for  1  "  T 

Case  3.  Let  I  be  partitioned  into  N  subintervals  of  length 

Again  we  take  the  functions  B^,  i  *  0.....N  to  be  those  defined  in 

(3.1).  Further  we  set  ^  =  Bi>  i  =  0,...,N.  Thus  M  of  section  2  is 

N+l  here  and  both  d>.  and  B.  are  linear  splines  defined  on  the  same 

J  i 

mesh.  In  this  case  the  structure  of  the  (N+l)  x  (N+l)  matrices 


5 


is  slightly  more  complicated  than  in  Cases  1  and  2.  These  matrices  are 
now  given  as  follows: 


with  the  first  entry  in  the  (0,0) -element.  For  j  *  1,...,N-1  we  have 


where  the  first  entry  of  the  nontrivial  submatrix  appears  in  the  (j-1, 
j-1)  position  of  .  Finally 


i 

0 

0 

"n-7 

• 

• 

• 

1 

-1 

0 

• 

-1 

1  . 

where  the  first  nonzero  entry 

occurs 

in  the 

N-l, 

N-l 

vectors  . 


element . 


The 


are  given  by 


I 


\ 

t 

f 

t 

I 

i 

■ 


i 

( 

i 

i 

i 


i 


k. 


i 


f- 

■ 

•w 


for  j  *  To  investigate  the  linear  independence  of  {H^y} 

note  that 

(H0y,...,HNy)  =  f  B, 

where  the  (N+l)  x  (N+l)  matrix  B  is  given  by 


Performing  row  operations  on  B  we  find  the  B  is  equivalent  to 
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We  then  see  that  B  has  rank  less  or  equal  to  N  and  therefore  we  have 

U 

Theorem  3.3.  In  case  3,  5  €  A  is  not  identifiable. 

If  one  decreases  the  number  of  A.'s  in  Case  3  then  it  is  reason- 
able  to  expect  that  sufficient  and  necessary  conditions  for  the  identi- 

U 

fiability  of  a  in  the  spirit  of  Cases  1  and  2  can  be  obtained.  We 
verify  this  next  for  a  particular  choice  of  N  and  M.  Moreover,  in 
Section  5  we  present  a  numerical  experiment  which  tends  to  support  this 
contention. 


Case  4 .  Let  I  be  partitioned  into  2N  subintervals  of  length  1/2N. 
We  choose  the  functions  B.  for  i  *  0.....2N  as 

l  ’ 


Ba(x) 


2Nx  -  i+1, 
-2Nx  ♦  i+1, 
0 


i-i  *  v  .  i 
2N  -  -  TF* 

L.  <  x  <  ill 

2N  -  -  2N  ’ 


otherwise. 


For  the  functions  4>j  we  take 


Nx  -  j  +  l# 


(x)  =  >  -Nx  +  j+1. 


j-1 


_ <  y  <  2. 

N  -  -  N  * 

i  <  x  <  hi , 

N  -  -  N  ’ 


otherwise , 


for  j  «  0,...,N;  thus  M  of  section  2  is  N+l.  The  (2N+1)  x  (2N+1)- 
matrices  are  given  as  follows: 


o 


Ho  = 


3  -3 

-3  4 

0  -1 


0 

-1 

1 


with  3  in  the  (0,0)  element. 


H  -  H 

Hj  '  2 


•  , 

• 

» 

o 

« 

• 

•  • 

•  ♦ 

1 

-1 

0 

0 

• 

*  • 

0  . 

-1 

4 

-3 

0 

0 

• 

A 

0 

-3 

6 

-3 

o  o 

V 

• 

0 

0 

-3 

4 

-l  ‘ 

% 

•  • 

0 

• 

0 

0 

-1 

•  * 

i  • 

•  •  • 

• 

• 

• 

o 

• 

• 

•  4 

where  the  first  entry  of  the  nontrivial  submatrix  occurs  in  the  2(i-l) 
2(i-l) -element  and 


1  -1  0 

-1  4  -3 

0-3  3 


Let  5  «  v(aM)  «  col (yQ, 


,V2N) .  Then  we  see  that 


-3y0  ♦  4Wj  -  y2 


HOy  *  2  -yl+y2 


U2N-2  ‘  y2N-l 
~y2N-2  +  4y2N-l 
"3y2N-l  +  3y2N 


and  for  j  =  1 , . . . ,N-1 


ll2i-2  '  *l2i-l 
-',2i-2  *  4^2i-l  '  Sl,2i 
f  -3u2i-l  *  tv2i  -  3^2i+l 
-3l,2i  *  4u2i.l  -  V21.2 
*p2i»l  *  u2l*2 


where  the  first  nonzero  entry  occurs  in  row  2(i-l).  To  investigate 

N 

the  linear  independence  of  (HjO^q  we  put  6^  ■  for 


i  =  1 


2N  and  observe  that 
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where  the  (2N+1)  *  (N+l)  matrix  B  is  given  by 


Performing  row  operations  on  B  we  obtain  an  equivalent  matrix  B. 


which  is  of  dimension  2N  x  (N+l) .  The  vectors  are  linearly  inde- 

pendent  if  rank(B)  ■  N+l. 

Theorem  3.4.  In  Case  4,  if  V2i-1  **  v2t-2  or  y2i  **  y2i-l  ^or  a11 

i  -  1,...,N  and  (W2i_1-U2i-2^y2i"v2i-l^  **  0  for  some  i  * 
where  y  ■  y(a^) ,  then  a^  6  A  is  identifiable. 

Proof.  Let  5  *  col(SQ 
that  B2i  _1*02i  t  °*  7116,1  °i  _i  *  °i  =  °*  Further  0^=0  for  all 

other  i,  since  ^  f  0  or  &2i  t  0*  This  implies  linear  independence 

of  the  columns  of  B  and  thus  of  B.  Theorem  3.1  then  implies  the  result 

Theorem  3.5.  In  Case  4,  if  *  y2  “  *  0  or  VN-1  '  VN_2  * 

“N  ■  Vl  ■  0  or  “2i-l  •  v2i-2  ■  v2i  -  "2i-l  •  “21.1  ‘  u2i  "  u2i.2  * 

y2i+1  *  0  for  some  i  «  2,...,N-2,  where  ii  «  y(a)  then  a  €  A  is  not 
identifiable. 

Proof.  Under  the  assumptions  of  the  theorem  the  column-rank  of  B  is 
not  maximal  and  this  implies  the  result. 

Remark  3.1.  Four  consecutive  zeros  in  the  B^'s  do  not  necessarily 

imply  nonidentifiability,  provided  the  zeros  start  with  an  even  index 

and  are  not  at  the  "beginning”  or  "end"  of  the  sequence  {B^;  in 

particular  B2i  *  B2i+1  *  &2i+2  *  B2i+3  *  0  with  2  ^  i  <  N-2  does 

not  imply  nonidentifiability.  For  example  let  N  ■  5,  Bj  *  B2  *  8j  = 

8g  *  6g  *  01O  *  1  and  &4  *  65  *  *  0.  Then  {H^y},  j  *  0,...,N, 

M 

with  Bi  *  are  linearly  independent.  Choosing  y^  and  a  , 

%  w 

we  can  thus  calculate  y^,  i  «  1.....2N  and  f  such  that  I  is 
identifiable  and  82^  *  ...  *  B^j  “ 
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Remark  3.2.  The  conditions  of  Theorems  3.1  -  3.4  are  conditions  on  the 
variation  of  adjacent  yi(aM) -values.  If  this  variation  is  sufficient, 
the  identifiability  of  I  is  guaranteed.  The  results  indicate  that 
the  larger  the  difference  between  the  dimension  of  the  state  space  approxi 
mation  and  the  dimension  of  the  parameter  space  approximation  is,  the 
more  likely  it  is  that  identifiability  of  the  approximated  coefficient 
holds.  In  [9]  identifiability  of  a  in  (1.1)  is  studied  under  various 
conditions  on  the  sign  of  ux  and  The  most  general  condition  im¬ 

plying  identifiability  of  a  is  inf  (max  |ux|,  u^x)  >  0.  Clearly  one 
can  construct  examples  where  this  condition  is  not  met  but  identifiability 
under  approximation,  e.g.,  according  to  one  of  the  cases  1  -  4,  of  a 
holds . 
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4.  Two  Stability  Concepts 

In  this  section  we  discuss  the  application  of  two  concepts  of  sta 
bility  to  the  finite  dimensional  output  least  squares  problem 

(p|J)  minimize  fu^a**)  -  z|^2  over  c» 

where  C  is  a  convex  and  closed  subset  of 


M 

l  afy. :  a*!*  €  R,  aM(x)  >  a  >  0, 

j-1  J  J  J 


N  M  r  — 

and  u  (a  )  *  l  y.B. ,  with  col(y_, . . . ,yM)  *  y  satisfying  (2.2).  The 

4  * 

*  w  M  N 

existence  of  a  minimum  a«  of  (PM)  can  easily  be  argued.  The  reason 

for  introducing  the  set  C  e  Q  ^  here  is  that  for  the  first  stability 

concept,  uniqueness  of  solutions  of  (pJJ)  is  required  and  this  cannot  be 

guaranteed  over  all  of  Qa(J.  Here  we  consider  H  smooth  approximating 

00 

coefficients,  some  remarks  on  L  approximations  are  given  further 

M 

below.  We  investigate  the  continuous  dependence  of  a*  on  z  and  also 

M  M 

on  Qa<j  when  dealing  with  the  second  stability  concept.  For  C  c  Q&d 

let  9^(C)  ■  (u^a**)  :  aM  €  C}  denote  the  attainable  set. 


M 

Definition  4.1.  [2]  The  parameter  a  in  (2.2)  is  called  output  least 

N  M 

squares  identifiable  (OLSI)  by  (PM)  over  C  c  Qgd,  if  there  exists  a 

neighborhood  &  of  ^(C)  such  that  for  every  z  €  ^  the  problem  (P^) 

M 

has  a  unique  solution  a”  depending  continuously  on  z. 

Let  c  Qad  be  such  that  {HjiKa^)}  is  linearly  independent 

M  v 

for  every  a  *  I  (a  ).$.  6  A,  ..  As  examples  for  such  sets  we  can 
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take  neighborhoods  in  Qa(j  of  points  of  identifiability  in  the  sense 


of  section  2. 


Theorem  4.1.  Let  A^  be  as  just  described.  Then  a  in  (2.2)  is  OLSI 

by  (P^)  over  every  closed  convex  subset  C  of  A^ ,  provided  that 

diam  C  is  sufficiently  small  and  z  is  sufficiently  close  to  the  <5^(C) . 

N  M 

More  precisely,  (P^)  has  a  unique  solution  aA  depending  Lipschitz- 
continuously  on  z  as  long  as  distCz/^C))  is  sufficiently  small. 

M  N  M 

Proof.  By  ([2],  Theorem  4)  it  suffices  to  show  that  a  -*•  u  (a  )  is 

M  N  M 

twice  continuously  Fr^chet  differentiable  with  a  u  M(a  )  injective 

a” 

on  C.  This  is  equivalent  to  the  existence  of  continuous  first  and 

second  order  derivatives  of  a  «►  y  (a  )  with  I  such  that 
M 

J  (a*1)  ,<p.  -  aM  €  A.  .  and  injectivity  of  y  wCa^1)  for  every  aM  €  C. 
j_l  J  1  inJ 

w  <s,  ****** 

Let  y^M(a  ;h)  *  n  and  y  M(a;h,k)  *  (  be  the  first,  resp.  second, 
a  a 

derivative  in  directions  h  and  (h,k) .  Then 


(4.1) 


(4.2) 


fi  =  -L'^Zh..  H.vKa”)) 


l  =  -L_1(Zk  H.  fj  ♦  E  h  H  y  M(aM;k)), 
11  1  1 


where  L  *  Z  a^H^  ♦  K  and  h  «  col  (h^, . . .  ,h^) ,  k  «  coUk^, 


,kM), 


and  the  continuity  assumptions  follow.  The  injectivity  of  y  M(a  )  is 

M  3 

guaranteed  by  linear  independence  of  (H^y(a  )};  this  ends  the  proof. 
To  describe  the  second  notion  of  stability  we  consider  the  case 

c  *  Qa,: 


*.  .s  . 


'•  ’ •  /»  .*•  '•  S*  V  •/ •  •  *  »  » '  •  '  «.  *  •  *  •  .  •*.  **,  •*»  •*,  »*’•*,  •**,*’*'"«*  ”, 


minimize 


| uN ( -  z|2  over  Q^. 


We  study  continuous  dependence  of  local  solutions  of  (p{J)w  on 

w  *  (z,ot,6)  €  W,  where  K  *  H®  xR  xR.  Here  W  is  endowed  with  the 

M 

Hilbert-space  product  norm.  We  always  assume  0  <  a  <  y,  so  that  Q&<j 
is  not  empty  and  solutions  of  (2.2)  and  (P^)  exist. 


M 

Definition  4.2.  [2]  The  parameter  a  is  called  output  least  squares 

(OLS) -stable  in  at  the  local  solution  a^  of  (pjj)  Q,  w°  €  W, 

0  0  w 

if  there  exists  a  neighborhood  V(w  )  of  w  in  W,  a  neighborhood 
M.  M  1 

V(aQ)  of  a'  in  H  and  a  constant  k,  such  that  for  all  w  *  (z,a,y)  € 
V(w°)  there  exists  a  local  solution  aJJ  of  (pJJ)w  with  a^  €  VCa^) 
and  for  all  local  solutions  aM  €  V(aJ?)  of  (P„)  we  have 

W  U  M  W 


|.“  -  ,«0|  ,  <  <|w-w#|i« 

w  wu  H1  W 


Remark  4.1.  In  comparing  OLSI  to  OLS-stability  we  observe  the  following 
differences:  OLSI  requires  uniqueness  of  the  solutions  of  the  minimiza¬ 
tion  problem,  whereas  for  OLS-stability,  uniqueness  is  not  required,  with 
continuity  being  checked  at  each  local  solution.  If  OLSI  holds,  then 
the  solutions  depend  on  the  observations  in  a  Lipschitz  continuous  way, 
whereas  OLS-stability  only  guarantees  Holder  continuous  dependence. 
Further,  OLSI  requires  continuous  dependence  of  the  solutions  on  the 
observation  only,  whereas  OLS-stability  involves  continuous  dependence  on 

the  observations  as  well  as  on  the  admissible  set  0  .. 

a  a 
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Output  least  squares  stability  is  proved  by  techniques  that  guar¬ 
antee  stability  of  solutions  of  abstract  optimization  problems  with  res¬ 
pect  to  perturbations  in  the  problem  data,  see  [2]  and  the  references 

MM  M 

given  there.  Let  A  =  span  and  let  F(a  )  be  the  Lagrange 

functional  associated  with  (P^) : 

F(aM)  =  |uN(aM)  -  z|2  -  X*g(aM), 

*  * 

where  X  €  C  x  1R  and 

M  *  , 

g:A  x  W  -*•  C  x  ]R  is  given  by 


,  M  .  ,  M  |  M,2  2,  ,  . 

g(a  ,w)  =  (a-a  ,  |a  |  .  -  y  ),  w  =  (z,a,yh 

H 

M  m  A 

Note  that  a  6  Q&d(w)  if  and  only  if  g(a  ,w)  6  K  =  C  x  ]R  ,  with  C 

and  1R  the  natural  negative  cones  in  C(I)  and  1R.  We  shall  fre¬ 
quently  drop  the  index  w  and  write  g(aM)  and  Qad  for  g(aM,w) 
and  Qad(w) . 

M  M 

Theorem  4.2.  Let  A  *  spant^}^  be  such  that  it  contains  the  constant 

functions,  let  (z0,a°,Y°)  *  w°  €  W  with  0  <  a°  <  Y°  and  let 
M 

ao  *  I  (»q) be  a  local  solution  of  (p[J)  Q.  If  {H^uCSq)}^  are 
linearly  independent  vectors  in  1RN+  and  |uN(ag)  -  z|  is  suffici¬ 
ently  small,  then  aM  is  OLS-stable  in  QM. (w°)  at  the  local  solution 

-  «*  o- 

W 

For  the  proof  of  this  theorem  the  following  lemma  on  the  regularity 

M 

of  the  constraint  set  Qfld  will  be  required;  its  proof  is  quite  similar 
to  that  of  Lemma  4.2  in  [2]  but  will  be  included  for  the  sake  of  com¬ 
pleteness. 


Lemma  4.1.  Let  A  contain  the  constant  functions.  Then  every  a  '  € 

Qad  is  a  regular  point,  i.e.,  0  6  int{g(aM)  *i?(gaM(aM))  -  K)  c  C  x  ir, 

where  &  denotes  the  range  of  the  mapping  g  M(aM) . 

a” 

Proof  of  Lemma  4.1.  We  need  to  show  that 

0  6  inttgCa*1)  ♦  g  M(aM)AM  -  C_  *  *_} 

(4.3) 

*  int{a-aM-hM*C  ,  |aM|2.  -  y2  ♦  2<aM,hM>  *  TR  :  hM  €  AM}, 

*  H  H1  + 

where  we  used  that  g  u(aM)hM  =  (-hM,2<a^,hM>  ,)  .  Let  (4>,r)  €  C  x  ]R 

aM  H1 

with  I  (4>»r)  lCx  <  and  fi  >  0  to  be  chosen  sufficiently  small. 

Note  that  <{>  -  min  $  €  C+,  and  min  $  €  A^.  In  view  of  the  first  com¬ 
ponent  in  (4.3)  we  decompose  $  as 

4>  *  a  -  aM  -  (a  -  aM  -  min  $)  ♦  4>  -  min  $ 

M  M 

and  therefore  <j>  €  a- a  -A  +C+.  As  for  the  second  component  in  (4.3) 
observe  that 

|  aM | 2  -  y2  ♦  2<aM,a-aM-min  4»  .  *  - 1  aM | 2  ♦  2<aM,a-min  4>>  .  -  y" 

H1  H1  H1 

la2  -  y2  ♦  26 1 aM|  . 

H1 

Thus,  for  6  sufficiently  small  one  can  choose  r  €  1R+  such  that 

r  *  |a^|2  -  y2  ♦  2<a^,  a  -  a^  -  min  $>  ,  ♦  f 

H1  H1 

and,  since  ($,r)  was  arbitrary,  aM  is  shown  to  be  a  regular  point. 
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Proof  of  Theorem  4.2.  We  apply  results  on  the  stability  of  abstract 

optimization  problems  as  summarized  in  section  3  of  [2],  Due  to  the 

fact  that  aM  -*■  |u^(a^)  -  z^|2  and  aM  ■+  g(a^,w®)  are  twice  continu- 
M  Mm 

ously  at  a^  and  since  the  point  a^  €  Qa(j  is  a  regular  point,  it 

suffices  to  establish  a  lower  bound  on  the  second  derivative  of  F  at 

a*1.  Let  n  -  uNM(ap;hM)  and  £  =  uNM  M(a”;hM,hM)  for  hM  €  AM.  Then 
a  a,  a 


F  M  M(a0;1,M-hM)  ‘  <UV“)  -  0  *  W20  '  2MhM|2, 

a  ,a  H  H  H 


>  -|uN(a”)-2|  |£|  *  |n|2  -  2A|hK|2  . 

HU  H°  H°  H1 


where  A  <_  0  is  the  Lagrange  multiplier  associated  with  the  norm  con¬ 
straint.  In  view  of  (4.1),  (4.2),  the  finite  dimensionality  of  AM, 
and  the  linear  independence  of  {HjW(a||) it  follows  that  there  exist 
constants  Cj  and  such  that 


»  -cj^ca”)  -  x|HofhM,^  *e2|hYr 


so  that  for  ju^(a^)  -  z|  sufficiently  small  there  exists  a  constant 
c3  with 


M 


from  which  the  result  follows  [2;  Theorem  3.2,  3.3]. 


Remark  4.2.  If  Q^,  c  L  only  and  |a^|  .  <_  y  is  replaced  by 

H 

M  M 

I  a  |  oo  i.  y  *n  t*ie  definition  of  Qa{j,  then  again  one  can  show  existence 
^  N 

of  solutions  of  (P^)  and  Theorem  4.1  holds  with  obvious  modifications. 
The  results  leading  to  Theorem  4.2  need  yet  to  be  generalized  to  handle 
the  nondifferentiable  iT-norm  constraint. 


5.  Numerical  Results 


In  this  section  we  present  some  results  of  a  numerical  experiment  to 
estimate  the  coefficient  a  in  (1.1)  given  observations  z  of  u.  To 
solve  (1.2)  with  C  =  I  we  consider  (2.2)  which  defines  a  mapping  a  -*• 
Stf)  for  a^  £  A,  and  the  finite  dimensional  minimization  problems 


(5.1) 


minimize 


f  (  I  -  z)  dx. 

Jq  'i=0  ii/ 


-*M  M 

For  our  experiments  we  imposed  no  constraints  on  a  6  IR  ,  although 


y(a^)  is  not  well  defined  for  some  a^.  The  basis  functions  <|k  and 


were  chosen  as  linear  spline  functions  with  equidistant  grid  on 


(0,1).  As  data  z  we  took  the  values  of  a  solution  of  (1.1)  by  choosing 


2  2 

the  coefficient  a  and  the  observation  z(x)  *  u(x)  =  x  (1-x)  ,  and  cal¬ 


culating  f  *  u  -  (aux)x  from  it.  Using  this  f  we  then  compute  0 


from  (2.2)  as  we  solve  (5.1).  For  the  minimization  the  Newton- Raphson 
algorithm  was  used. 

In  our  calculations  N  =  10  represents  the  number  of  subintervals 
used  in  the  linear  spline  approximation  for  the  solution  of  (1.1).  Thus 
the  dimension  of  the  approximation  space  for  the  solution  is  11.  Further 
NBI  is  the  number  of  subintervals  of  I  that  determine  the  linear  spline 
approximation  of  a;  the  dimension  of  the  approximation  space  for  a 


is  NBI  +1.  A  necessary  condition  for  identifiability  of  5  is  thus 
NBI  ♦  1  £  11,  see  Corollary  2.1.  We  show  calculations  for  NBI  = 

4,  5,  6,  8  -  11,  for  the  choice  of  a(x)  *  1  ♦  x.  In  the  first  five  cases 
good  results  are  obtained.  Note  that  NBI  =  5  and  N  =10  is  a  special 


u 

case  of  Theorem  3.4.  In  the  case  NBI  *  10,  a  is  not  identifiable  by 

Theorem  3.3.  Numerically  this  is  reflected  by  the  appearance  of  oscilla¬ 
tions  as  NBI  approaches  10  from  below,  see  the  graphs  for  NBI  *  9,  10,  11. 
The  start-up  value  for  the  minimization  routine  was  chosen  as  a^  =  2. 

We  point  out  that  a  different  scaling  of  the  axes  in  the  various  graphs 

N  M 

was  utilized.  We  also  show  the  graphs  for  u  (a  ),  when  N  =  10  and 
NBI  *  9,  10,  11.  The  -graphs  for  the  approximating  solutions  for  NBI  = 

4,  5,  6,  8  are  indistinguishable  from  NBI  =  9. 
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FIGURE  b 
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FIGURE  7 
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